|
|
Accession Number |
TCMCG042C52370 |
gbkey |
CDS |
Protein Id |
XP_016484506.1 |
Location |
join(30231..31113,31706..31799,31933..32035,32119..32172,32305..32379,32679..32747,32862..32945,33044..33235,34151..34701,34775..34904,35786..35877,35969..36147,36218..36420,36510..36611,37294..37456,38472..38761,38833..38970,39450..39529,39657..39756,40006..40135,40359..40591) |
Gene |
LOC107805046 |
GeneID |
107805046 |
Organism |
Nicotiana tabacum |
|
|
Length |
1314aa |
Molecule type |
protein |
Topology |
linear |
Data_file_division |
PLN |
dblink |
BioProject:PRJNA319578 |
db_source |
XM_016629020.1
|
Definition |
PREDICTED: DNA mismatch repair protein MSH6-like [Nicotiana tabacum] |
CDS: ATGGGTTCTTCTCGCCGCTCCAGCAATGGCAGATCTCCGATCGTCAATCAACAGAGTCAAATCACTTCTTTCTTCTCTAAAATGACTTCGCCCTCACCTTCTCCTTCCCCATCACCTCTTGTCCCTAAAAAAATTCCAGTCAAATCTAACCCTAACCCTAACCCTAATGCTGAGCCTAAACTTAAATATAGTCCTAGTACCAGTCCTTGTGCTAGTCCTACGACACCTTCGCCTCTACAGGTGAAGCGGAAGATAACTGCACCTATTTCTGCCATTATTGACCTTAAGCCGTCGTACGGGCAAGAGATAGTGGGCAAAAGAGTTAAGGTCTACTGGCCATTGGATAGAACTTGGTACGAAGGCTGTGTAAAGTCTTTCGACGGTGTTTCTGGTGAACATTTGGTTAAGTACGACGATGGTGATGAGGAAATGATTGATTTAGCTGAAGAAAAGATCGAATTGGTAGTCGAGGCACCTGCGAGAAAGTTGAGGCGGTTGCGGAAATCTTTGGTGGTGGAAGAAGCTGAGGAGGAGGAGGAGGAAGAGAAATTGGAGGATTTGGAGAGCGTTGAAGATGATTCTGAGGATGAAGATTGGGGAAAAATTGCGGATAAACAAGTGTATGAAGACGAGGATGTTGATGAGGATATGGACTTGGTGGTTGAGGAAGAGAAAGATGATGCTGTTGGATCGAGAAGCAGGAAAGCGGGTGCGGATAAGGTGGTGGTGTCGAGGAAGCGGAAGAGTGGTGAAGGGGTGAAGTTAAGTTCGAGTTCGAGCAAGAAGAGTAAGACTCTTGCAGATAAGAAGAGTGCTAATAGCAAGGTGGACAATGCAGTGAATGGAGTAAATGGGAAAGAGCTTGTTAAAACCAATGAGGATTGTGTCAGGCCAACCAACAATGATAACGTACTGCTGTGCGGTGCAGCAGATAGATTCGGACAACGTGAAGCAGAGAAATTCCCTTTTGTTGCGAAAGATAGGAAGGACGCTAATAGGAGATCCCCTGGAGATGCCAATTATGATCCAAAGACTCTTTACCTACCTCCTAATTTTTTGAAAGGTTTAACTGGTGGTCAGAGACAATGGTGGGAGTTCAAGTCGAAGCACATGGATAAAGTTCTGTTTTTTAAGATGGGAAAGTTCTATGAGCTTTATGAGATGGATGCACATATTGGAACCAAGGAACTTCATTTGCAGTACATGAAGGGAGAACAACCCCATTGTGGATTTCCAGAAAAGAACTTCTCAATGAATGTAGAGAAGTTGGCGCGAAAGGGTTATAGGGTTCTTGTGGTTGAGCAAACAGAGACACCTGAACAGCTTGAGACTCGTCGAAGAGAGAAGGGATCTAAAGATAAGGTCGTCAGACGTGAAATATGTGCAGTGGTCACTAAAGGAACATTAACTGAGGGAGAAATGCTCGCAGCAAACCCTGATGCTTCATATATGATGGCAGTGACTGAAAGCTCTCAAACTGCTGTATTGCAAGGGAAGCGTACTTATGGTGTCTGTATGGTGGATATCACCACAAGCAAGGTTATTATTGGACAGTTTGAGGATGATTCAGATTGTAGTGCCTTGTGTTGTCTGCTTTCTGAGTTAAGACCAGTGGAAATAATAAAGCCAGCTAAATTGCTTAGTCTTGAGACTGAGAGAGTACTGCTGCGGTACACACGTAATCCGCTGGTAAATGAGTTGGTTCCTGTCTCTGAATTTTGGGATGCTGAGAGAACCATTTGTGAGGTGAAGGCAATCTATAGGAATATGAGCAGTCCACCGCTGACATCATCTCCAAATGAAATGGAATCACATGAAAGCACTACCTCAGAGGAATATGGTGAAAGGAACCTTCTACCAGATGTTTTATGTGAGCTTGTAAATCTTGGTAGGAATGGGAGTTATGCACTCTCAGCACTAGGAGGAGCTCTATACTACTTGAAGCAAGCTTTTCTGGACGAATCCCTGCTCAAATTTGCGAAATTTGAACCACTTCCCCTTTCTGGTTTTTGTGATAGTACTCAAAAACCGAATATGGCTCTTGATGCAGCTGCGCTTGAGAATCTTGAGATATTTGAGAACAGTCGAGATGGAGATTCTTCAGGGACATTATACGCTCAAATCAACCATTGTATCACAGCATTTGGGAAAAGGATGCTCAGGTCATGGCTTGCAAGACCCTTATATCATCCAGAGTCCATAAGAGAACGTCAGGATGCTGTAGCCGGATTAAAGGGGCTCAATCTACCTTTTGTTCTTGAGTTTAGAAAAGAGTTGTCAAGGCTTCCTGATATGGAACGGTTGCTTGCACGCCTCTTTGGTAGCAGTGAAGCAAATGGAAGAAATGCAAATAAAGTGATTTTATACGAGGATGCAGCAAAGAAACAACTGCAAGAGTTCGTATCTGCTTTACGTGGATGTGAATCAATGGTGCATGCATGCTCTTCACTTGGGGTGATCTTGGAAAACATGGATTCAAAGCTACTATATTATCTATTAACACCAGGTAAAGGTCTTCCAGATGTAGATTCAATTCTCAAGCATTTCAAGGATGCTTTTGATTGGGTAGAAGCAAATAACTCGGGCCGTATTATACCTCATGAGGGGGTTGATGAGGAGTATGATGCTGCATGTAAACAATTGCAGGAGATTGAACTTAAATTATCCAAGCACTTGAAGGAACAGAGGAAACTGCTTGGAGACTCATCAATAGACTACGTGACTGTAGGAAAAGATGCATACCTTTTGGAAGTACCAGAATGTTTGTGCAGGAGCATTCCGAAGGAGTACGAATTACAGTCATCGAAAAAGGGTTATTTCAGGTACTGGAATCCAGTCTTAAAGAAATTAATCGGAGAGCTCTCACAAGCTGATTCAGAGAAGGAATCTAAGCTAAAAAGTATTTTGCAGAGGTTGATAGGACGGTTTTGTGAACATCATAATAAGTGGAGAGAATTAGTTTGTATCACTGCAGAATTGGATGTTTTAATCAGTTTATCTATTGCGAGCGATTACTATGAGGGACCAACATGTCGTCCAAACATCAAGTCAGTGCCAAGTGAAGATGATGTGCCAGTTCTTCATGCTGAAAATTTAGGACATCCTGTTCTTAAAAGTGATTCTCTAGATAAGGGAGCTTTTGTTTCCAACAATGTTTCCCTTGGCGGTCCTCCGAACGCCAGCTTTATCCTTCTTACTGGTCCTAACATGGGAGGGAAATCCACTCTTTTGCGCCAAGTTTGCATGGCTGTAATTTTGGCCCAGATAGGAGCTGATGTACCAGCATCATCCTTTGACTTATCACCCGTCGATCGTATATTTGTAAGAATGGGGGCCAAAGATCATATTATGGCAGGCCAGAGTACATTCTTGACAGAACTCTTGGAAACTGCTTCAATGCTGTCTTTGGCGAGCCGTAATTCACTTGTCGCACTCGATGAACTTGGTCGCGGTACATCAACTTCCGATGGACAAGCAATAGCTGAATCAGTTCTTGAACACTTTGTCCACAAGGTGCAATGTCGAGGAATGTTTTCTACCCACTATCATCGATTATCTATTGACTATCAGAAAGATTCCAGAGTGTCACTGTGCCATATGGCATGCCAAGTTGGGAAAGGGTCCGGAGGTCTTGAGGAAGTTACTTTTCTATACAGGTTGACACCAGGTGCATGTCCTAAAAGTTATGGTGTCAATGTGGCACGGCTGGCTGGACTTCCTGATGGTGTGCTTCAGAGAGCTGCTGCTAAATCTGAAGAGTTTGAAATTAATGGTTACAATAAGCAATCTGAAGAGAACTCCTATGGGAATTTGACAAGAAAGACAGCAGCACTTGTGCAGAATTTGATGAATTTTATTATTGAAGAGAAATGTGACAATGGTGTGGTTCTTTGTGAGTTGAATGGATTGCAAAGGAGAGCAAGAATACTCCTTGAACAAAATTGA |
Protein: MGSSRRSSNGRSPIVNQQSQITSFFSKMTSPSPSPSPSPLVPKKIPVKSNPNPNPNAEPKLKYSPSTSPCASPTTPSPLQVKRKITAPISAIIDLKPSYGQEIVGKRVKVYWPLDRTWYEGCVKSFDGVSGEHLVKYDDGDEEMIDLAEEKIELVVEAPARKLRRLRKSLVVEEAEEEEEEEKLEDLESVEDDSEDEDWGKIADKQVYEDEDVDEDMDLVVEEEKDDAVGSRSRKAGADKVVVSRKRKSGEGVKLSSSSSKKSKTLADKKSANSKVDNAVNGVNGKELVKTNEDCVRPTNNDNVLLCGAADRFGQREAEKFPFVAKDRKDANRRSPGDANYDPKTLYLPPNFLKGLTGGQRQWWEFKSKHMDKVLFFKMGKFYELYEMDAHIGTKELHLQYMKGEQPHCGFPEKNFSMNVEKLARKGYRVLVVEQTETPEQLETRRREKGSKDKVVRREICAVVTKGTLTEGEMLAANPDASYMMAVTESSQTAVLQGKRTYGVCMVDITTSKVIIGQFEDDSDCSALCCLLSELRPVEIIKPAKLLSLETERVLLRYTRNPLVNELVPVSEFWDAERTICEVKAIYRNMSSPPLTSSPNEMESHESTTSEEYGERNLLPDVLCELVNLGRNGSYALSALGGALYYLKQAFLDESLLKFAKFEPLPLSGFCDSTQKPNMALDAAALENLEIFENSRDGDSSGTLYAQINHCITAFGKRMLRSWLARPLYHPESIRERQDAVAGLKGLNLPFVLEFRKELSRLPDMERLLARLFGSSEANGRNANKVILYEDAAKKQLQEFVSALRGCESMVHACSSLGVILENMDSKLLYYLLTPGKGLPDVDSILKHFKDAFDWVEANNSGRIIPHEGVDEEYDAACKQLQEIELKLSKHLKEQRKLLGDSSIDYVTVGKDAYLLEVPECLCRSIPKEYELQSSKKGYFRYWNPVLKKLIGELSQADSEKESKLKSILQRLIGRFCEHHNKWRELVCITAELDVLISLSIASDYYEGPTCRPNIKSVPSEDDVPVLHAENLGHPVLKSDSLDKGAFVSNNVSLGGPPNASFILLTGPNMGGKSTLLRQVCMAVILAQIGADVPASSFDLSPVDRIFVRMGAKDHIMAGQSTFLTELLETASMLSLASRNSLVALDELGRGTSTSDGQAIAESVLEHFVHKVQCRGMFSTHYHRLSIDYQKDSRVSLCHMACQVGKGSGGLEEVTFLYRLTPGACPKSYGVNVARLAGLPDGVLQRAAAKSEEFEINGYNKQSEENSYGNLTRKTAALVQNLMNFIIEEKCDNGVVLCELNGLQRRARILLEQN |